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1. Introduction 

Let TC be a Hilbert space. We consider quantum channel 0] with a 
finite input alphabet {l,...,a} and with pure signal states Si = >< 
= l,...,a. Compound channel of length n is in the n-tensor prod- 
uct of the space TC , i.e. TL n = TC ® . . . ® TC . An input block (code- 
word) u = (zi, . . . ,i n ) , ik € {1, . . . a}, for it means using the compound 
state ip u = Vii ® ■ • • ® "tpin e ^« an d the corresponding density oper- 
ator S u = \ip u >< 4>u\ in the space 7i n . A code (C,X) of cardinal- 
ity M in H n is a collection of M pairs (u 1 , X\), . . . , (u M , Xm), where 
X = {Xi, . . . , Xm, X m+ i} is a quantum decision rule, i.e. some resolution of 
identity in 7i n Q. The conditional probability P(u t \u k ) to make a decision 
in favor of message u l provided that message u k was transmitted is given 
by 

P(u l \u k ) = Tr S u kXi =< ip u k\Xiil) u k > . 

In particular, the probability to make a wrong decision when the message 
u k is transmitted is 

1 - Tr S u kX k = 1- < 4> u k\X k 4> u k > . 

For given code (C, X) of cardinality M we consider the following two error 
probabilities 

X max (C,X) = max [1- < ^ u k\X k ^ u k >] , 

Kk<M 
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M 



A(c,x) = ^Et 1 - < MXk1>** >) ■ 



k=i 



We shall denote by P e (M,n) any of the following two error probabilities 

X max {M, n) = inf X max {C, X), A(M, n) = inf X(C, X). 
c,.x. c,.x. 

It is well-known in classical information theory ||T| (Corollary 2 to Theorem 
5.6.2) that both A max (M, n) and X(M,n) are essentially equivalent to each 
other. This remark obviously remains valid for quantum channels as well. 

The Shannon capacity of quantum channel was naturally defined in || 
as the number C such that P e (e nR ,n) tends to zero as n — > oo for any 
< R < C and does not tend to zero if R > C. Moreover, if R < C then 
P e (e nR ,n) tends to zero exponentially with n and we are interested in the 
logarithmic rate of convergence given by the reliability function 

E(R) = lim sup - In - -. 1 p — -, < R < C . (1) 

v 1 n^oo v n P e (e nR ,n) K ' 

Our main results are the bounds for E(R), reminiscent of the corresponding 
bounds in the classical information theory It is also remarkable that a 
number of tricks from the classical information theory works quite well for 
quantum channels. 

Let tt = {iTi} be a probability distribution on the input alphabet {1, a}, 
then we denote S n = Y%=i n iSi- Let Xfj = 1, d, where d is the dimension 
of Ti,, be the eigenvalues of operator S w (obviously, forming a probability 
distribution). Then 

d 

H(S*) = -Tr5^ In 5^ = - E X j ln X j ( 2 ) 

3=1 

is the quantum entropy of the density operator S T . The upper bound for the 
capacity 

C < max H(S n ) 

7T 

follows directly from the entropy bound 0. Recently, in || the converse 
inequality C > m&x n H(S n ) was established, implying the formula 

C = max H (S n ). (3) 
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(This result was generalized to arbitrary signal states in ||) 
was based on the notion of typical subspace introduced in || , 
of our estimates for the reliability function is an alternative approach to the 
converse inequality which makes no use of the notion of typical subspace. 



The proof in [0 
10]. A corollary 



2. The random coding lower bound 

Let A(V, . . . , u M ) be the average error probability corresponding to code- 
words u 1 , ...,u of the input alphabet of the length n minimized over all 
quantum decision rules. Let rr = {iii} be a probability distribution on the 
input alphabet {l,...,a} and assume that the words are chosen at random, 
independently, and with the probability distribution 

P{u= (ix,...,i n )} = 7r il • ... -7ri n (4) 

for each word. 

Proposition 1. For all M, n and < s < 1 



where 

d 

TrSl +s = J2 X ) +S ■ 
o=i 

Proof.Q Let us for a while restrict to the subspace of H generated by the 
signal vectors >; i — 1, . . . , a, and consider the Gram matrix T(u l , . . . , u M ) = 
[< VvlVv >] an d the Gram operator G(v}, . . . ,u M ) = J2if=i >< ^Pu k \- 
This operator has the matrix T(u l , . . . ,u AI ) in the (possibly overcomplete) 
basis 

^ = G- 1/2 (u\...,u M )ij u ,; i = l,...,M. 

In it was shown that using the resolution of identity of the form = 
\ipk >< i>k\ we can upperbound the average error probability as 

\(u\ . . . , u M ) < A sp (e - rvv, • • • > « M )) > (6) 

^^The authors are grateful to T. Ogawa for pointing out an error in the earlier version 
of the proof. 
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< 2{M 



TrS, 



1+8 
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where E is the unit M x M-matrix and Sp is the trace of M x M-matrix. 
Indeed, 



M 



\{u\ . . . , U M ) = — J^i 1 - I < V»u* IV'u* > 



M 
2 



fc=i 

M 



< m El 1 - 1 < > I] = jj Ei 1 - < Vvl^V, • • • ,u M )4> uk >), 



k=l 



k=l 

which is (6). 

The first step of our argument is to remark that 



A S P (E - r^V, ■ ■ ■ , u M )) fj(M TrG^(u\ . . . , u M )). (7) 

In what follows we shall skip u 1 , . . . , u M from notations. Consider two oper- 
ator inequalities 

-2G 1/2 < -2G + 2G, 
-2G 1/2 < -2G + (G 2 -G). 
The first one is obvious, while the second follows from the inequality 

-2x l/2 = 2(1 - x 1/2 ) - 2 = (1 - x l/2 ) 2 - 1 - x < (1 - xf - 1 - x = x 2 - 3x, 

valid for x > 0. Taking the expectation with respect to the probability dis- 
tribution (13), we get 



-2EG 1/2 < -2EG 



2 EG 

E(G 2 -G) • 



Now 



M 



EG = E£ |V> u * >< ^«*| = ME|W >< W=l = MSf n , 

k=l 

M M M 

E(G 2 -G) = E^ElVv ><^ll^ ><Vvl -E^IVv ><Vu* 



fc=l Z=l 



fc=l 



E^ |Vv >< VvlVv >< Vvl = M ( M - 1) 



5; 
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Let{ej} be the orthonormal basis of eigenvectors, and Aj the corresponding 
eigenvalues of the operator S® n . Then 



-2 (ej|EG 1/2 | ej ) <= -2MAj + MXj min (2, (AT - . 
Using the inequality min{a, 6} < a s 6 1_s , < s < 1, we get 

min (2, (M - l)Aj) < 2(M - l) 8 \ l j +8 , < s < 1 . 
Summing with respect to J and dividing by M, we get from (|6|), (0) 



E A(u\ . . . , u M ) < 2(M-1) S ^ A^ +s = 2(M-r 

j 



TrS. 



l+s 



< s < 1 . □ 



It is natural to introduce the function fi(ir, s) similar to analogous function 
in classical information theory (e.g. [|TJ, Ch. 5) 

d 

//(tt, s) = - In Tr = - In £ A) +s . (8) 

i=i 

Clearly, /x(7r, 0) = 0. Using the formulas 

Tr^+ S ln^ 



/i'(7T,s) 

// (7T,S) 



Tr ' 
(Tr In S n ) 2 - Tr ^ +s (ln S n ) 2 Tr 



(Tr^+ S ) 2 

it is easy to check that /j,(tt, s) is nondecreasing and H-convex function of s. 
Moreover, 

fi'(7T,0)=H(S 7T ). 

There is special case where among the signal vectors i = 1, . . . , a, 
there are k > 1 mutually orthogonal: then defining n to be the uniform 
distribution on these vectors, one has [i(tt, s) — sin k. Otherwise the function 
fi(ir, s) is strictly increasing and strictly fl-convex (see Fig. 1). 

By taking M = e nR , we obtain 

Corollary 1. For any < R < C the following lower bound holds 

E(R) > max max (^(vr, s) - sR) = E r (R). (9) 
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Corollary 2. 

Proof. Indeed, 



C > max H(S W ). 



(10) 



C > max max — > maxu'(7r, 0) = max H(SJ) . □. 

~~ 7T 0<S<1 S ~ T n 

Inequality flT0|) together with the converse inequality from |4j] provides an 
alternative proof of the formula (f|). 

The maximization with respect to s can be treated in the same way as in 
Ch. 5.7 of Q. Defining the function 

E r (ir, R) = max [/i(7r, s) — sR], (11) 

we have 

E r (n,R)=fi(n,l)-R for < R < //'(tt, 1), 

where 

a 

ju(7r, 1) = -lnTrS 1 ^ = - In ^ iXi-n^ < ^{i/jj > | 2 . 

For /i'(7r, 1) < R < C the function E r (ir, R) is a U-convex and is given by 

E r (ir, R) = /z(7T, 8^) - Siji?, 

where is the root of the equation /i'(vr, sr) = R (see Fig. 2). 

3. The expurgated lower bound 

When we choose codewords randomly there is certain probability that 
some codewords will coincide that makes error probability for such code 
equal to 1. It turns out that probability to choose such a bad code does 
not influence essentially the average code error probability if the rate R is 
relatively high. Conversely, it becomes dominating for low rates R. All these 
effects are well described in [[rj, Ch. 5.7. In order to reduce the influence 
of choosing such bad codes an elegant "expurgation" technique has been 
developed. 

We start with an ensemble of codes with M' = 2M — 1 codewords and 
denote by 

A fc = [l - | < VvlVv > I 2 ] 
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the probability of erroneous decision for the word u k , when the decision rule 
{Xk} from Sec. 2 is used. Then according to the Lemma from Ch. 5.7 [[l 
there exists a code in the ensemble of codes with M' = 2M — 1 codewords, 
for which at least M codewords satisfy 



A fc < [2EA' ; 



a/r 



(12) 



for arbitrary < r < 1 (without loss of generality we can assume that (|P2]) 
holds for k = 1, . . . , M). Then we can use an estimate from || to evaluate 
the righthand side of (|1^). By using the inequality ^/j > §7 — for 7 > 0, 
one obtains 

A fc < 2[1- < 4* |G5 (u\ . . . , u M ')^ Uk >] 
< 2 - 3 < 4*|G(u\ • • • , u M ')ij Uk > + < i u u\G 2 {u\ u M ')^ Uk > 

= J2 \ < > I 2 , 

where the summation is over i from 1 to M' . 

Using the inequality (J2 a iY < J2 a i ? < r < 1, we get for randomly 
chosen codewords 



EAfc < (M' — 1)E I < V> U |VV > I = 2 ( M - 1 



2r 



i,jfc=l 



Substituting this into (0) and denoting s = -, we obtain 
Proposition 2. For all s > 1 



X max (M,n) < 4(M- 1) 



i,jfe=l 



(13) 



Taking M = e , we obtain the lower bound with expurgation 

In 4 In 4 

E(R) > maxmax(yu(7r, s) — s(R H )) = E ex {R H ), 



vr S >1 



n 



where 



/2(7T, s) = -Sin ^ TTiTTfel < ipi\ip k > \ l. 



i,fc=l 
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The function ft(ir, s) is fl-convex, increasing from the value 
/2(7r, 1) = fi(ir, 1) = - In TrS% 

for s = 1 to 

a 

fi(ir, oo) = - In | < ipi\ipk > | 2 , 



i,fc=l 



(which may be infinite if there are orthogonal states). The behavior of jlijr, s) 
in the case where this value is finite is shown on Fig. 1. 
By introducing 

E ex (n, R) = max[/i(7r, s) — sR], (14) 

S>1 

we can investigate the behavior of E ex (7r, R) like in the classical case. Namely, 
for < R < fi'(n, 1), where 



£i,*=l VTiTTfel < ipj \ip k > | 2 In | < 1pi\lp k > 



/i(7r, 1) = -lnTrS^ + ! — - <h(tt,1), 



the function E ex (n,R) is U-convex decreasing from 

a 

E ex (ir, +0) = /i(vr, oo) = - ^ vr^ In | < > ? (15) 



«,fc=i 



to E ex (fi'(n, 1)) = ^(ff, 1) — /i'(7T, 1). In the interval 1) < R < yu(7r, 1) it 
is linear function 

E ex (n,R) = //(7T, 1) - i?, 

and E ex (7c, R) = for /2(tt, 1) < i? < C. 

Thus comparing it with E r (ir,R) we have in generic case the picture on 
Fig. 2: 

£ r (7r,i?) < E ex (n,R), < R< Jjf(ir,l); 

E r (7r, R) = E ex (n, R), /}'(7r, 1) < i? < //(tt, 1); 
E r (7r,R) > E ex (ir,R), /{ti,1)<R<C. 

However, it may happen that ^'(tc, 1) > 1), in which case the linear 
piece of the bound is absent. 

The value ([15]) is in fact exact as the following proposition shows. 



S 



Proposition 3. If | < ipi\ipk > \ > for any 1 < i, k < a then 

a 

£(+0) = -min V 7Ti7r fc ln| < VtlVfc > ?■ (16) 
W i,fc=i 

// | < V'ilV'fc > | = for some i, k, then E(+0) = oo . 

Proof. From fljjD we see that £'(+0) is greater than or equal to the 
righthand side of (|TUj) . On the other hand, from testing two hypotheses ([[J, 
p. 130, (2.34)), we have 



A ma x(C,X) > 2 



ll - max | < ip u \ip u > > 



1 . 

> - max | < VmVv > I 

4 u^tu' 



where u, u' are codewords from C = (u 1 , . . . , w M ), and therefore 

2 



^(+0) < - lim 



max In | < ip u \ip u ' > 



Denote ip u {k) the fc-th component of the codeblock u and let ki = i^iM be 
the number of codeblocks u with ip u (k) — ipi , % — 1, . . . , d . Then we have 

max In | < V«|Vv > | > — — — ^ ln I < V^lVv > 



u,u'ec 

n 

r- min 

— 1 1 \<h<n 

u.u'ec 



> 7T mrn y ha I < ib u {k)\ib u i(k) > 

M(M — 1) l<fc<n ^ 1 ru\ J^UV J 



n i a a 



M(M - 1) {h} 



nM J a a 

~ (M-l) W [f^fe 

from where it follows 

a 

£■(+0) < - min ^ 7r 4 vr fc Ixl | < ^[ > ?■ 
W i,fc=i 

In a result, we get fllSp.D 



9 



4. The binary quantum channel 



Maximization of the bounds E r (ir, R), E ex (n, R) over n, which is a difficult 
problem even in the classical case, is still more difficult in quantum case. 
However, if the distribution 7r° maximizing either fi(ir, s) or ft^n, s) is the 
same for all s, then the analysis of Sees. 2, 3 applies to functions 

E r (R) = E r (ir°, R), E ex (R) = E ex (ir°, R). 

Let a — d — 2 and >, \ipi > be two pure states with | < ipo\ipi > \ = e. 
Consider the operator S n — (1 — ir)S + irSi . Its eigenvectors have the form 
|^o > > with some a. Therefore for its eigenvalues we get the equation 

((1 - tt)|^ >< Vol + tt|Vi >< M (IV'o > +o#i >) = A (l^o > +a\ip 1 >) . 
Solving it, we find the eigenvalues 



1 - 4(1 - e 2 )7r(l - tt) 



A 9 (tt) 



- [1 + ^1 - 4(1 - e 2 )7r(l - tt) . 
It is easy to check that both functions 

M7r,s) = -ln (A 1 (7r) 1+S + A 2 (7r) 1+S ), 

//(tt, s) = -s In (tt 2 + (1 - tt) 2 + 2?r(l - vr)e 2/s ) 
are maximized by 7r = 1/2. Denoting 



Ms)=/i(l/2, S ) = -ln 



1 -e 



l+s 



+ 



1 + e 



p,(s) = /2(l/2,s) = —sin 
we get the following bound 



c 2/s 



E{R) > fl(s R ) - S R R, 0<R< 

E(R) > fi{l) - R, <R<n'(l); 

E(R) > fi(s R ) - s R R, //(I) < i2 < C, 
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where sr, sr are solutions of the equations £i'(sr) 

/ l + e 2 \ 



Ml) =Mi) 



In 



A'Ci 



= i2, 



e 2 lne 2 



1?. 



1 + e 



2 - 



;i-e) 2 ln(^) + (l + e) 2 ln(^ 



C = M(0) 



1 - e 



In 



2(1 + 6 2 ; 

1 - e' 



1 + e 



In 



1 + e 



Moreover, from Proposition 3, 

E(+0) = -lne. 

5. Comments on the case of arbitrary signal states 

The case where the signal states are given by commuting density operators 
Si reduces to the case of classical channel with transition probabilities A*, 
where A* are the eigenvalues of Si. The classical bound given by Theorem 
5.6.1 [0] in the case of commuting density operators takes the form 



EX(u\ . . . , u M ) < mm(M - l) s ( Tr 



,i=i 



l+s\ 



(17) 



The expurgated bound given by Theorem 5.7.1 of [T[] in the case of com- 
muting density operators reads 



X max (M,n) <min 4(M - 1 



8>1 



:,k=i 



(18) 



The righthand sides of (|17D , (^q) are meaningful for arbitrary density opera- 
tors, which gives some hope that these estimates could be generalized to the 
noncommutative case with minor modifications. 
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